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ABSTRACT 

We measure spectral features of —3500 i-C-sclccted galaxies at 0.5 < z < 2.0 from high quality 
medium-band photometry using a new technique. First, we divide the galaxy sample in 32 subsamplcs 
based on the similarities between the full spectral energy distributions (SEDs) of the galaxies. For 
each of these 32 galaxy types we construct a composite SED by de-redshifting and scaling the observed 
photometry. This approach increases the signal-to-noise ratio and sampling of galaxy SEDs and allows 
for model-independent stellar population studies. The composite SEDs are of spectroscopic quality, 
and facilitate - for the first time - Ha measurement for a large magnitude-limited sample of distant 
galaxies. The linewidths indicate a photometric redshift uncertainty of Az < 0.02 x (1 + z). The 
composite SEDs also show the Balmer and 4000 A breaks, Mgll absorption at —2800 A, the dust 
absorption feature at 2175 A, and blended [OiiiJ+H/3 emission. We compare the total equivalent 
width of Ha, [Nil], and [Sn] (W H a+) with the strength of the 4000 A break (D(4000)) and the best- 
fit specific star formation rate, and find that all these properties are strongly correlated. This is a 
reassuring result, as currently most distant stellar population studies are based on just continuum 
emission. Furthermore, the relation between TFhq+ and _D(4000) provides interesting clues to the 
SFHs of galaxies, as these features are sensitive to different stellar ages. We find that the correlation 
between Wh q + and Z?(4000) at 0.5 < z < 2.0 is similar to z — 0, and that the suppression of star 
formation in galaxies at z < 2 is generally not abrupt, but a gradual process. 

Subject headings: galaxies: evolution — galaxies: stellar content 



1. INTRODUCTION 

Galaxies in the local universe give us many clues to 
their star formation histories (SFHs). Massive elliptical 
galaxies can have stellar ages nearly as old as the age of 
the universe, while spiral galaxies are still forming new 
stars. Detailed studies of nearby stellar populations al- 
low accurate age measurements, and thus a measure of 
the SFHs in these systems. While such detailed studies 
will never be possible at cosmological distances, studying 
galaxies throughout cosmic time offers another advan- 
tage: we can directly study their masses and star forma- 
tion rates (SFRs) at different phases in their lives, rather 
than inferring these quantities from age measurements. 

In the past few years, photometric and spectro- 
scopic galaxy surveys have started to explore the uni- 
verse at rcdshifts z = 1 — 3, the peak of the quasar 
era and the epoch when the volume-av eraged SFR of 
the U niverse was at its maximum (e.g, iBouwens et al.l 
120 111) . This peak is at least in part caused by a 
population of galaxies with apparently very high spe- 
cific SFR (S S FR; S FR divided by stellar mass) (e.g., 
Noeske et all l2007allbl: iDavis et al.1 120071: iSalim et al 



20071 : lElbaz et al.ll2007HDamen et al. 11200a lol iver et al 



20101 : iPapovich et al.l 120 111) . Remarkably, at the same 
time there are galaxies that appear to have already 
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stopped forming stars. Massive, quiescent galaxies 
have been iden t ified o ut to at least z — 2.5 (e.g., 
McCarthy et~aTI l200l lLabbe et al.l 120051: IKriek et al.1 



2006bl I2008bt ICimatti et al.ll2008t IWilliams et al.l l200a 



Brammer et al.l 120091 IWhitaker et al.l 120101 ) . implying 
that they formed their stars at even earlier times. 

A major complication in all high-redshift studies is the 
lack of uniform and reliable measurements of SFHs. In 
the local universe, the "standard" indicator of the in- 
stant aneous SFR is the Ha emission line (e.g.. lKennicuttl 
119981 ) and a standard indicator of the age of a stel- 
lar population is th e strength of the 4000 A HK con- 
tinuum break (e.g.. iKauffmann et al.l 12003 al ) . At rcd- 
shifts z > 1, both features shift into the observed 
near-infrared (NIR) and are very difficult to measure. 
As a result, studies of galaxies at z > 1 are ei- 
ther limited to s mall and often biased sampl e s (e.g. , 
Erb et all I2006allb1lcl: IKriek et al.1 l2006al [20071 l2008at 



Muzzin ct al. 2009, 2010: R cddv et al.ll2010l) or are based 



on SFH indicators that are less robust and less well cali- 
brated, such as broadband spectral energy distributions 
(SEDs) (e.g..lvan Dokkum et al.ll2006l: lLabbe et al.l l2007l: 
IPapovich et al.ll2007l 120111 : iDamen et al. 112009^ 

As a compromise between sample size and higher 
spectral resolution, we have un dertaken the NEWFIRM 
medium-band survey (NMBS; Ivan Dokkum et all 12001 
Whitaker et all IMTlh in the COSM OS (IScoville et atl 



2007D and AEGIS (jPavis et al.ll2007l ) fields. The "high" 



resolution NIR photometry in combination with the pub- 
lic data available in these fields provides accurate pho- 
tometric redshifts and stellar population properties out 
to z — 3 (iBrammer et al.ll2009l: Ivan Dokkum et al1l2010l: 
IWhitaker et al.ll2010b IKriek et al.ll2010l:lMarchesini et al.l 
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[20101: iWake et all |20TT1 ; iBrammer etHI l20ll . In the 

present work we identify groups of analogous galaxies 
at 0.5 < z < 2.0 and construct composite SEDs. This 
approach increases the signal-to-noise and sampling of 
galaxy SEDs and allows for model-independent stellar 
population studies. The SEDs are of exquisite quality, 
showing spectral features, such as the Ha+[N n] + [S 11] 
and H/?+[Oiii] emission lines, the Balmer or 4000 A 
breaks, Mg 11 absorption at 2800 A, the continuum break 
at 2640 A, and the dust absorption feature at 2175 A. 
Thus, these SEDs can be used to study the stellar popu- 
lations and emission line characteristics of typical galax- 
ies in a very detailed way, which is not possible when 
the photometry of individual galaxies is considered sep- 
arately. 

This first paper describes our technique, provides an 
overview of all composite SEDs and example rest-frame 
optical morphologies, and studies the correlations be- 
tween the equivalent width (EW) of Ha, the strength 
of the 4000 A break, and the SSFRs of - 3500 galaxies 
at 0.5 < z < 2.0. Additionally, we compare our measure- 
ments to those of low-redshift galaxies from the SDSS 
and stellar population synthesis (SPS) models, and dis- 
cuss the implications for the SFHs of galaxies. Other 
applications of the composite spectra, including the rela- 
tion between galaxy size and SED type, dust properties, 
and the demographics of active galactic nuclei (AGNs) 
will be discussed in future papers. 
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Figure 1. Top: The 22 synthetic rest- frame filters used to identify 
galaxies with similar SED shapes. Bottom: SEDs and 22 rest-frame 
colors for four different galaxy types. This panel illustrates that 
using these 22 bands we can discriminate between different types 
and find galaxies with similar SED shapes and thus properties. For 
example, the value for b between the two closest SEDs in this figure 
(the dusty star-forming in purple and old galaxy in red) is 0.23. 



using the following expression: 



2. CONSTRUCTING COMPOSITE SEDS 

The galaxy sample used to construct the compos- 
ite SEDs is selected from the NMBS. The NMBS 
uses five custom NIR filters and covers a total area 
of 0. 4 square degree s in the COSMOS (IScoville et al.l 
[2001 and AEGIS (iDavis et al.l [200l fields. The 
medium-band filter set is combined with p ublicly avail- 
able i maging at FUV-to-MIR wa v elengths (|Erben et all 
2009t iHildebrandt et all [200l iBarmbv et al.l 120081: 



Sanders et all 120071: iCapak et al.l [2007F as described in 
Whitaker et al.l ( 20111 ). For the work presented in this 



paper, we limit our sample to the COSMOS field, as 
medium-band optical photometr y from Subaru is avail- 
able for this field as well (e.g., iTaniguchi et al.l 120071 : 
lllbert et al.l 120091 Y. Taniguchi et al. in prepara- 
tion). The high photometric sampling provides accu- 
rate phot ometric redshifts and stellar population proper- 
ties (e.g.. IBrammer et aLll2009l : Ivan Dokkum et al.ll201d : 
IWhitaker et al.l 120101 ) . The photometric redshifts and 
stellar population pro perties are d erived using EAZY 
(|Brammer et al.l 12001) and FAST (jKriek et alj l2009al ). 
respectively. In order to limit our sample to high-quality 
SEDs with rest-frame UV-to-NIR wavelength coverage, 
we require a redshift of 0.5 < z < 2.0 and a signal-to- 
noise (S/N) of 25 in the K-band. 

We define 22 artificial rest-frame filters of intermediate 
bandwidth, evenly spaced in log A between 1200 A and 
50 000 A, and compute the fluxes in these filters for all 
galaxies in our sampl e, using the method described by 
IBrammer et al.l (|2009f) . In Figure Q] we show the filters 
in comparison to the SEDs of different galaxy types. For 
each galaxy in our sample we identify analogous galaxies, 



/E (/° bl 



n fob2^2 



with a the scaling factor 



an 



Yj y°bt 



(2) 



and b the variation between two galaxies. In practice, 
we generally use fewer than 22 rest-frame filters to relate 
galaxies, as we only compare the rest-frame wavelength 
coverage that two galaxies have in common. 

We start by taking b < 0.05 and identify the galaxy 
with most analogs; this is the primary galaxy for the 
first subsample. We choose this value of b as it yields 
distinct composite SEDs, while keeping the intrinsic scat- 
ter per SED type low. By decreasing the value for b we 
found identical best-fit stellar population models for sim- 
ilar SED types, while for larger values of b the SEDs of 
individual "analog" galaxies were not well matched. 

Next, we remove the primary galaxy and all its analogs 
from the parent sample and again find the galaxy with 
the largest number of analogs. We repeat this proce- 
dure until the primary galaxy has fewer than 19 analogs. 
In total we find 32 subsamples with at least 20 galaxies, 
which include 83% of the total galaxy sample. Some ana- 
log galaxies that have been removed in one of the first 
few iterations might have been a better match with an- 
other galaxy type which has been constructed during a 
later iteration. Thus, after having divided the galaxies 
into subsamples, we reassign 15% of the galaxies which 
are a better match with a primary galaxy of another sub- 



Ha and 4000 A Breaks for ^3500 Distant Galaxies 



3 



1.0 

0.5 
0.0 
1.0 

0.5 
0.0 
1.0 

0.5 
0.0 
1.0 

0.5 
0.0 



Observed 
fluxes 



De-redshifted 
observed 
fluxes 



□ • 



Scaled 
rest-frame 
B ■ fluxes 
\ 

a 




Observed 
fluxes of all 
analogs 



Observed photometry: 
Primary galaxy (z = 0.74) 

□ Analog galaxy (z-1.24) 

Rest-frame photometry: 
• Primary galaxy 

□ Analog galaxy 



Rest-frame 
fluxes 



5 Scaled 
flPlg observed 
fluxes 



Composite 
/VA SED 

/ \ 



10000 



10000 



(Angstrom) 



(Angstrom) 



Figure 2. Illustration of our method to construct composite 
SEDs. In panel 1 we show the primary galaxy and one of its 
analogs. We de-redshift both galaxies to the rest-frame (panel 2) 
and determine the fluxes for the artificial filters, as shown in Figure 
1 (panel 3). Next, we scale the analog spectrum to its primary us- 
ing the fluxes of the rest-frame filters they have in common (panel 
4). In panel 5 we show the combined scaled and de-redshifted ob- 
served photometry of the primary and analog galaxy. We do the 
same for all analogs, and obtain the combined photometry in panel 
6. Finally, we construct a composite SED by averaging the scaled, 
de-redshifted observed fluxes in wavelength bins. 



sample. For example, several galaxies in the subsample 
of type 3 move to type 2 or 4 (see Section [3]) . 

We create a composite SED for each subsample, by 
de-redshifting all galaxies, and scaling the observed pho- 
tometry of the analogs to that of the primary galaxy. 
Our method is illustrated in Figure [2] Note, that we 
use observed photometry in the stacks, not the 22 rest- 
frame filters; the 22 filters are only used to find analogs. 
We construct the composite SEDs by averaging the rest- 
frame scaled fluxes of the primary and analog galaxies in 
wavelength bins. The number of data-points per wave- 
length bin depends on the number of analog galaxies. For 
each bin we calculate both the error on the mean (using 
bootstrap resampling) and the scatter in the distribu- 
tion. To determine the scatter we first derive and remove 
the running median from all combined photometric data 
points. Next, we derive the scatter within each bin using 
the residuals. Finally, we determine the effective filter 
curve for each bin by adding the de-redshifted, normal- 
ized (by total integrated transmission) filter curves of all 
included data points. 

As mentioned above, the composite SEDs include 83% 
of the total galaxy sample. In Figure [3] we show the par- 
ent sample and indicate the 83% that we use in the com- 
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Figure 3. Stellar mass vs. redshift for all 4235 galaxies in the 
NMBS-COSMOS field with 0.5 < z < 2.0 and a S/N of > 25 in the 
K-band. The 83% of the galaxies that are included in the 32 SED 
types are shown in black, while the remaining 17% are indicated 
by the gray symbols. In the inset we show the fraction of galaxies 
that is included as a function of redshift. In Appendix A we further 
discuss the completeness of our SED sample. 



posite SEDs as a function of stellar mass and redshift. 
The inset shows the completeness fraction as a function 
of redshift. The figure clearly illustrates that our sample 
is weighted towards lower redshift, with a median red- 
shift of z = 0.89. In Appendix A, we further discuss the 
completeness of the sample and what types of galaxies 
we may be missing. 

3. ANALYSIS 

The composite SEDs are presented in Figure 2] They 
are of exquisite quality, showing spectral features, such 
as the Ha+ [N n] + [S n] and H/3+ [O in] emission lines, the 
Balmer or 4000 A breaks, Mg n absorption at 2800 A, the 
continuum break at 2640 A, and the dust absorption fea- 
ture around 2175 A. In Figure [5] we indicate all these fea- 
ture in 3 different SED types: a blue star-forming galaxy 
type, a dusty star-forming galaxy type, and a quiescent 
galaxy type. In this section we will study the stellar pop- 
ulations of the different SED types using spectral indices 
and by comparing the full SED shapes with SPS models. 

3.1. SPS fitting 

In this section we derive stellar population properties 
by comparing the composite SEDs wi th SPS models. We 
fit the co mposite SEDs by both the iBruzual k, Charlotl 
((2001 and lMarastonl (120051) SPS models, assuming a de- 
layed exponential SFH of the form ip(t) oc texp(— t/r), 
and leaving age (t), the e- folding time (r), the amount 
of dust attenuation (Ay), and metallicity (Z) as free 
parameters (see no te to Table [5]). We assume the 
iCalzetti et all pOOOl ) attenuation curve, which we imple- 
ment as a uniform screen. We adopt a iSalpeteii (119551) 
initial mass function (IMF). Compared to a iKroupal 
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Figure 4. Left panels: Composite SEDs constructed from the NMBS photometry for galaxies at 0.5 < z < 2.0 with S/Nj{ > 25. The 
SEDs are ordered according to their D(4000), starting with t he SEP type with the str onge st 4000 A break. The dashed line indicates 
the location of Ha. The red and purple curves arc the best-fit Bruzual & Chariot (2003) and Maraston (2005) models, respectively. The 



value for D(4000) and Wjja+i the best-fit value for Av for the Bruzual fc Charlotl ( 120031 ) models, and the number of galaxy included in 
the composite SED are printed in each panel. Right panels: For each composite SED we show the i-band HST/ACS images of the nine 
galaxies closest in redshift to z = 0.9, thus probing A ~ 4000 A in rest-frame. 



(|200l or IChabrierl pOOl IMF our choice will primar- 
ily affect the mass-to-light ratio, and has little impact on 
other stellar population properties. We mask the bins 
that are possibly contaminated by Ha while fitting. For 
the photometric uncertainties we take the scatter, and 
thus our confidence intervals on the stellar population 
properties essentially reflect the variation within a galaxy 
subsample. 

The bes t fits for the iBruzual fc Charlotl pOOl and 
IMarastonl (|2005| ) SPS models are represented by the 
red and purple curves in Figure HJ respectively. The 
corresponding best-fit values and confidence intervals 
are given in Table 1. The reduced \ 2 values indi- 
cate that most SED types are equally well fit by both 



models, though the stellar population properties im- 
plied by t he fits are not always con s istent with each 
other (see IConrov et all pOOl l2010a| ): iConrov fc Gunr] 
pOlOt ) for a detailed discussion on the uncertainties in 
SPS models). The exception though are post-starburst 
galaxies, which are represented by SED t ype 6 (and 
to a lesser extent type 7). As shown in iKriek et al.l 
pOlOf) . the IMarastonl (|2005() models overestimate the 
contribution of the thermally-pulsing asymptotic giant 
branch (TP-AGB) st ars at NIR wavelengths (see also 
IConrov fc Gmm1l20ToT ). SED type 12 is poorly described 
by both models, though it is not clear why. These galax- 
ies might have been placed at the wrong redshift, result- 
ing in a SED shape which cannot be explained by any 
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Figure 5. Zoom on the spectroscopic features of three different 
SED types. For the quiescent type (red) we detect the 4000 A 
break, Mgn absorption at 2800 A, and the continuum break at 
2640 A. For the dusty star-forming type (purple) we detect the dust 
absorption feature at 2175 A, the Ha+ [N n] + [S II] emission line and 
possibly the H/3+[Oiii] emission line. For the less obscured star- 
forming type (blue) we detect the Balmer break, and the two sets 
of blended emission lines. 

SPS model. This will be explored in detail in I. Labbe 
ct al. (in preparation). 

Many SEDs arc poorly fit in the rest-frame UV. 
This is likely due to the presence of the dust bump at 
2175 A (IStecherl 119651 ). which is not included in the 
used attenuation model. Although this feature is seen 
in the Milky Way and several other galaxies (even at 
high redshift), the exact origin is still unkn own (e.g., 
IDraine fc MalhotralfTMl IConrov et al.ll2010b[ ). We will 
further investigate this in a future paper. 

3.2. L>(4000) 

The composite SEDs allow a direct measurement of 
the 4000 A break. The 4000 A break is due to ab- 
sorption in the atmospheres of stars and arises because 
of an accumulation of absorption lines of primarily ion- 
ized metals, among which Call H and K. As the opac- 
ity increases with decreasing stellar temperature, the 
4000 A break gets larger with older ages, and is largest 
for old and metal-rich stellar populations. The 4000 A 
break is generally quantified by D„(4000) (|Balogh et al.l 
Gill, which measures the ratio of the average flux den- 
sity F v in continuum regions blucward (3850-3950 A) 
and redward (4 000-4100 A) of the break, or by £>(4000) 
(jBruzuall H98l . which uses larger continuum regions 
around the break (3750-3950 A and 4050-4250 A). We 
measure Z?(4000) for all our composite SEDs, as it better 
matches the resolution of our composite SEDs. We de- 
termine the confidence intervals using Monte Carlo sim- 
ulations. For the photometric errors we take the error on 
the mean, as explained in Section [2j 



The break measurements are affected by the low resolu- 
tion of the medium bands and possible photometric red- 
shifts errors. We correct for the resolution as explained 
in detail in Appendix C. Summarized, we derive a cor- 
rection factor for each break measurement using the dif- 
ference between the break measurement for the original 
best-fit stellar population model and the best-fit model 
convolved by the composite filter curves (see Figure in 
Appendix c). The exact correction factor depends pri- 
marily on the strength of the break, but also on the sam- 
pling of the composite filters. The original and corrected 
values for I? (4000) are given in Tabic 1. 

Correcting for photometric redshift errors is more chal- 
lenging, as we do not know the uncertainties for the dif- 
ferent SED types. In appendix B we show the effects 
of the break measurement in case the median random 
scatter is Az/(1 + z) = 0.005, 0.01, 0.02, or 0.04. We 
do not apply this correction, but caution that the break 
measurements, in particular for large values of D(4000), 
are possibly underestimated. 

3.3. Ha emission line 

Remarkably, we see a bump at the location of Ha in 
most SEDs. In several composite SEDs we also see indi- 
cations for H/3 and/or [Om] near 0.5 /im. We measure 
the equivalent width (EW) of Ha+[Nll]+[S II] in the fol- 
lowing way. As the best stellar population fits to full SED 
do not always provide a good fit to the continuum region 
around Ha, we refit the SED restricting the wavelength 
region to 4000 < A < 10000 A, while masking the Ha, 
[Nil], [Sn], H/3, and [Om] emission lines. We subtract 
the continuum model, thus correcting for the underlying 
Ha absorption. Next, we make an emission-line model, 
assuming a fixed ratio of the [Nil A6583]/Ha of 0.25, [Nil 
A6583]/[Nn A6548] of 3, and [Sn]/Ha of 0.3. We con- 
volve the model with the composite filter curves, scale it 
to fit the data, and derive the total EW of the blended 
Ha, [Nil], and [Sn] lines. 

Confidence intervals on Wh q + arc determined using 
Monte Carlo simulations. We simultaneously vary the 
photometric uncertainties (following the error on the 
mean, see Section [2]) and the continuum level. For the 
latter we assume an uncertainty of 1%. We also randomly 
vary the emission-line ratios in the simulations, assum- 
ing an uncertainty of 0.1 on both [Nil A6583]/Ha and 
[Sn]/Ha. However, as we derive the total EWs, the line 
ratios will barely affect the best-fit value or its confidence 
intervals. 

Figure [5] shows examples of the Ha emission line for 
seven composite SEDs, ordered by decreasing Wh«+- 
The consistency between the shape of the observed emis- 
sion lines and the convolved emission-line models is strik- 
ing. This result demonstrates the accuracy of our pho- 
tometric rcdshifts. In Figure [7] we assess the influence of 
different values for Az/(1 + z) on the shape and strength 
of the detected Ha emission line. The observed shapes 
give us a rough constraint on the photometric redshift 
uncertainties for certain SED types. The average ran- 
dom uncertainty is Az /(1+z) 5, 0.02, consist ent with the 
uncertainties found bv lWhitaker et all ()2011[ ) when com- 
paring photometric with spectroscopic redshifts. We can- 
not rule out catastrophic outliers, as they simply would 
increase the apparent variation among individual SEDs. 
Thus, the EWs of Ha might be slightly underestimated. 
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Table 2 

Stellar population properties, continued 
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Note. — We fit a grid of stellar population models assuming a delayed exponential SFH °^ tcxp(—t/r)) with logfi/yr) between 

7.6 and 10.1, i n step s of 0.05, log(r/yr) bet ween 7 and 10 in steps of 0.1, and Ay between and 3 in steps of 0.1 mag. We assume the 
ICalzetti et~al~l J2000fl attenuation curve and alSalpeteil d!955|1 I MF. For each model we explore three different metallicities: sub solar (0.01 
and 008 f or thelMarastonl $200?$) an d lBruzual & C hariot (2003]) models, respectively), solar, and supcrsolar (0.04 and 0.05 for the lMarastonl 
J2005D and lBruzual & C hariot (200.3) models, respectively). The errors represent the 68% confidence intervals. 
a The iteration in which the SED is constructed 
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Figure 6. Illustration of Ha measurements. The presented composite SEDs cover a wide range in Whc«+, and are ordered from the 
highest to the lowest Whci+ when going from left to right. The contin uum emission is fit locally, betw een 4000-10000 A , while masking Hf3, 
[Om], Ha, [Nil], and [Sn]. The red and orange c urves represent the Bruzual & Chariot (2003) and Maraston (2005) continuum models, 
respectively. In the middle panels we remove the [Bruzual & Cliarlotl l2003lT continuum model and fit the remaining emission line (red), 
taking into account the composite filter curves, as represented in the bottom panels. No redshift uncertainties are assumed when fitting the 
line. The consistency between the observed shapes and those predicted by the filter curves demonstrate the accuracy of our photometric 
redshifts (see Figure[7Jl. The middle panels also show indications for H/3 and/or [Om] at ~5000A in certain cases (dashed lines). 
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6000 7000 6000 7000 6000 7000 

A (A) 

Figure 7. Fits to Ha emission lines for different photometric red- 
shift uncertainties. The red curve is the best-fit model when as- 
suming that the photometric rcdshifts are perfect. For the orange, 
green, and blue models the random scatter A.z/(1 + z) is 0.02, 
0.04, and 0.06, respectively. This plot illustrates that the NMBS 
photometric redshifts are certain to Az/(1 + z) < 0.02. 

3.4. Rest-frame optical morphologies 

For each SED type we also show the morphologies of a 
subsamplc of the galaxies. We choose the nine galaxies 
that are closest in rcdshift to 0.9, as for these galaxies 
the available HST/ACS i-band imaging from COSMOS 
probes the rest-frame optical. The images are displayed 
in Figure 0] next to the composite SEDs. In this paper 
we will not give a quantitative analysis of the sizes and 
morphologies; this will be discussed in detail in a future 
paper. 

4. CORRELATIONS 

4.1. Star Formation History Indicators 

In the previous section we have directly measured 
two key indicators of present and past star formation 
in galaxies: Z)(4000), which is sensitive to the age of 
the stellar population (as well as metallicity) , and Ha, 
which measures the instantaneous SFR normalized to 
the amount of stellar continuum emission at 6563 A. To- 
gether, these parameters measure the star formation his- 
tory of a galaxy at different times, with D(4000) mostly 
sensitive to past star formation and Ha mostly sensitive 
to current star formation. Additionally, we derive the 
SSFR by comparing the full continuum shape with SPS 
models. The SSFR is the SFR divided by the stellar 
mass, and thus, similar to Wh q +, it is a measure of the 
current versus past star formation, though independently 
determined. 

In Figure [8] we compare the different parameters and 
show H / Ha+ versus both L>(4000) and the SSFR. Note 
that D(4000) and the modeled SSFRs are not completely 
independent, as both are derived from the continuum 
shape, and thus are not directly compared. Remark- 
ably, both panels show a tight correlation. Galaxies with 



weak breaks have high values for W / Ha+ and are best 
fit by stellar population models with high SSFR, while 
galaxies with low SSFRs and strong 4000 A breaks have 
low values for W / Ha+- This is a remarkable result given 
that the relations are derived without spectroscopy, and 
are solely based on medium-band photometry. Moreover, 
the Ha emission provides an independent measure of the 
stellar populations, and confirms our findings based on 
just the continuum shape. This is a reassuring result, 
as at the targeted redshift range Ha measurements are 
rare, and thus our understanding of galaxies is primarily 
based on photometric measurements, which use uncer- 
tain stellar population models to interpret the nature of 
the continuum emission. 

There are several caveats. Firstly, as indicated by 
the solid gray dots and dotted gray lines, the 4000 A 
breaks are corrected for resolution (see Section 3.2 and 
Appendix O). As the correction s facto rs are derived using 
the best-fit iBruzual &: Charlotl (|2003f ) models, they may 
be uncertain for galaxies which are poorly fit. Secondly, 
none of the values are corrected for photometric rcdshift 
uncertainties. Thus, depending on the exact redshift un- 
certainties, D(4000) and H^Ha-i- may have been slightly 
underestimated for certain SED types. For D(4000) this 
only affects the SED types with strong breaks. Thirdly, 
W / H Q + also includes line contributions from [Nil] and 
[S n] . Their relative contribution will be higher for metal 
rich systems or for cases in which an AGN is contribut- 
ing to the line emission. Thus, caution is required when 
interpreting this plot. 

Lastly, although Z?(4000) is measured over a narrow 
wavelength interval, dust obscuration slightly affects the 
break measurement, as indicated by the horizontal dust 
vector in Figure [8] Wh q + is also affected by dust, as 
the attenuation toward the continuum emission is gener- 
ally less than the extinction towards star-forming regions 
(vertical dust vector, Figure [8]). In order to illustrate 
possible offsets, the galaxies in Figure [5] are color coded 
according to their best- fit values for Ay. As expected, 
the most dusty galaxies, as indicated in orange, indeed 
have slightly stronger 4000 A breaks compared to less 
obscured galaxies with similar values for Whq+- How- 
ever, we do not correct for this, as the values of Ay are 
poorly constrained. 

4.2. Dust and Morphologies 

Taking the measured values for Ay at face value, we 
find that the oldest galaxies with large D(4000) and low 
W / Ha+ typically have lower values for Ay than the star- 
forming galaxies. Interestingly, the four youngest galaxy 
types with the lowest values for Z?(4000) and the highest 
SSFR (29, 30, 31 & 32) have similarly low extinction as 
galaxies with the strongest breaks. Thus, this may imply 
that star-forming galaxies are more dusty at intermediate 
SSFRs. 

We also qualitatively investigate how the morpholo- 
gies change with SED type. When ordering the images 
by the strength of D(4000), a few simple observations 
can be made. Firstly, the morphologies of the galaxies 
with the strongest 4000 A break are rather regular. This 
is also found in the loca l universe and at z ~ 2.3 (e.g., 
van Dokkum et all [20081 ) . Secondly, more than half of 
the displayed galaxies of the "oldest" SED type (1) have 
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Figure 8. Left: Correlation between WH a + and D(4000) for the 32 composite SEDs. The corresponding IDs arc printed in each symbol. 
The color coding of the symbols reflect their best-fit value for Ay, as indicated by the color bar in the bottom left. The triangles represent 
lcr upper limits. The gray dots represent the values for D(4000) which are uncorrected for the NMBS resolution (see Appendix B). They 
are connected to the corrected values by the dotted gray lines. The horizontal dust vectors indicates the total attenuation. The vertical 
dust vector indicates extra extinction towards Hn regions. Right: Correlation between W"Ha+ and the best-fit SSFR. This figure illustrates 
that IVjja+i D(4000), and the best-fit SSFR are strongly correlated: galaxies with shallow breaks have high values for M / Ha+ and are best 
fit by stellar population models with high SSFR, while older galaxies with low SSFRs and strong 4000 A breaks have low values for Wu a +- 



close neighbors, suggestive of mergers. The sample is 
small though, and this observation needs to be further 
assessed. Thirdly, almost all of the displayed galaxies 
with the highest dust content (e.g., 15 & 19) are edge- 
on disks. SED types with similar values for Wh<*+ and 
Z?(4000) as the edge-on disks, but with a lower dust con- 
tent (e.g., 16 & 18) contain several face-on disks, which 
are likely the same class of objects viewed from a dif- 
ferent angle. When going to lower values of D(4000) 
and higher values of Wiia-H more galaxies have irreg- 
ular and clumpy mo rphologies. This is consistent with 
what is found at low (IQverzier etll|[2008h or even higher 
redsh ift (e.g-. lKriek et al.ll2009bt iForster Schreiber et al.l 
120111) . A more quantitative analysis will be presented in 
a future paper. 

5. COMPARISON TO LOW- REDSHIFT GALAXIES AND SPS 
MODEL PREDICTIONS 

5.1. Comparison to the Sloan Digital Sky Survey 

In this section we compare our measure ments with 
those of low-redshift galaxies from the SDSS (jYork et al.1 
|2000|) . For the SDSS galaxies we use the spectro- 
scopic ineasuremejrtsfrojn_the MPA-JHU DR7 release 
(e.g.. iKauffmann et all l2003al iBrinchmann et al.l [200l 
iTremonti et al.ll2004ft . In Figure IH1 we show the 0.5 < z < 
2.0 correlations between Wh q + and D(4000) in compar- 
ison to the SDSS galaxies. No similarly derived SSFRs 
are available for the galaxies in the SDSS and thus they 
are not shown in the right panel. The sizes of the symbols 
in Figure [9] reflect the number of galaxies per composite 
SED. 



The relation between Wh q + and D(4000) for the galax- 
ies at 0.5 < z < 2.0 is similar to the low-redshift SDSS 
galaxies. There may be a slight offset to higher values 
of Who+ for the z ~ 1 galaxies. This possible offset is 
likely not caused by systematic errors. Photometric red- 
shift uncertainties would move the 0.5 < z < 2.0 relation 
in the opposite direction, and contamination by an [0 11] 
emission line would make the break at z ~ 1 even weaker. 

We investigate possible physical explanations by com- 
paring the observations with different model predictions. 
In Figure IH1 we show tracks for differen t SFH s, which 
are derived from the IBruzual &: Chariot! ((2003ft models 
in combin ation with the rel ation between SFR and Ha 
as given bv lKennicuttl(fT998]) . The SFHs arc visualized in 
the inset in the right panel in Figure [9l For subsolar, so- 
lar, and supersolar metallicity we assume (N H+S 11) /Ha 
of 0. 23, 0.8, and 0.8 respectively (e.g., IDenicolo et al.l 
120021) . Comparison with model tracks suggests that 
galaxies at z ~ 1 may have SFHs with longer timescales 
or higher metallicity compared to z — galaxies. 

The slightly different loci of the SDSS galaxies and 
our composite SED sample may also be due to system- 
atic differences in the reddening toward H 11 regions. For 
example, star-forming regions may be relatively more ob- 
scured at low redshift. Higher black- hole accretion rates 
at earlier times could also explain the offset location of 
certain SED types at 0.5 < z < 2.0. This is further 
supported in the right panel of Figure |H1 as the loca- 
tion of several composite SEDs cannot be explained by 
any SFH. In this context it is interesting to note that in 
the SDSS many massive, quiescent galaxies are known 
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Figure 9. Left: The EW of Ho vs the strength of the 4000 A break for the 32 SED types presented in this work (solid circles) compared 
to low-redshift galaxies in the SDSS (blocked gray scale) and different SFHs (colored lines). We explore models with SFHs with gradual 
declining SFRs (red) and contrast those with SFHs with abrupt declines (yellow, green, and purple). The shapes of the SFHs are visualized 
in the inset in the right panel. Subsolar, solar, and supersolar models are represented by the dotted, solid, and dashed lines, respectively. 
The sizes of the solid circles scale with the number of galaxies per SED type. For c larity, error-bars are not shown in this figure (see 
Figure©. Right: The EW of Ho vs. the best-fit modeled specific SFR (assuming the IBruzual fc Chariot! (2003ft models) for the 32 SED 
types in comparison to different SFHs. Symbols and lines are similar to the left panel. No similarly derived SSFRs are available for the 
galaxies in the SDSS. Models with abruptly declining SFRs predict that evolved galaxies at low to intermediate 15(4000) and SSFRs have 
low values for Wn a + (yellow, green, and purple). Since the bulk of the galaxies do lie well above those curves, we conclude that these 
models do not describe the majority of the galaxies. Thus, the suppression of star formation at z < 2 is generally a gradual process. 



to host LINERs (e.g- lKauffmann et alJl2003bf >. and thus 
the M^hq+ °f the quiescent galaxies in SDSS likely do 
not reflect their star formation activity. Follow-up spec- 
troscopy is needed to study the nature and origin of the 
line emission. 

5.2. Implication for Star Formation Histories 

It is striking that -D(4000) and Wh q + correlate so well, 
given that W / hq+ is mostly sensitive to current star for- 
mation, while D(4000) primarily reflects the past star 
formation. As shown by the various model tracks in the 
left panel of Figure [9j galaxies could in principle occupy 
roughly all space in between the yellow solid and red 
dashed line. However, we find that the composite SEDs 
and the individual SDSS galaxies span a tight relation, 
close to the solid purple or dotted red line at higher 
values of Wh q + and close to the red solid line for low 
values of Who+- Thus, star- forming galaxies seem bet- 
ter explained by short star formation timescales or lower 
metallicities, while the more quiescent galaxies are bet- 
ter explained by SFHs with exponential decaying times 
of about 1 Gyr and solar metalliticy. 

This finding suggests that the more quiescent galaxies 
contain stars over a wide age range and that their star 
formation is gradually declining. To further illustrate 
this point, we have added a truncated model to Figure [SJ 
This model has a constant SFR for about 1 Gyr, after 
which the star formation declines to zero. The model is 
convolved by a Gaussian with a a of 50 Myr. Thus, the 



decline occurs over roughly 0.2 Gyr. This model predicts 
an abrupt decline of Wh q + compared to the evolution of 
.0(4000), and therefore a population of galaxies with very 
weak or absent Ha emission and weak 4000 A breaks. 
However, of all composite SEDs with D(4000) < 0.24 
all have detected Ha emission with a minimum value 
of 17.7 A. 

We can further quantify this argument by consider- 
ing the slow suppression model in red and the fast sup- 
pression model in green. For each model we derive how 
long it will take for the 4000 A break to increase from 
log £>(4000)= 0.16 to log L>(4000) = 0.24. For the two 
models this is 1.56 and 0.78 Gyr, respectively. Thus, if 
only these two SFHs would exist and both star forma- 
tion timescales were equally common, we would find that 
33% of the galaxies with log D(4000) = [0.16, 0.24] would 
have undetected Ha emission. However, we find that 
out of 778 galaxies with 0.16 <log Z?(4000)< 0.24, none 
have detected Ha lower than 17.7 A. The non-existence of 
these composite SEDs implies that SFHs with very short 
star-forming and quenching timescales are quite uncom- 
mon. However, short timescales may be possible for the 
quiescent galaxies that have no detected Ha emission 
(upper limits), if there quenching occurred at z > 2. 

We note that AGNs may complicate this argument. As 
we have discussed above, the line emission, in particular 
for galaxies with strong 4000 A breaks (both at z ~ and 
z ~ 1) may originate from actively-accreting black holes. 
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Although it is unlikely that the line emission in all these 
galaxy types is dominated by AGNs, caution is required 
in the interpretation, and follow-up spectroscopic studies 
are needed to uncover the origin of the ionized gas. 

6. SUMMARY AND CONCLUSIONS 

In this paper we study the Ha emission line, the 4000 A 
break, and the SSFRs of ~3500 galaxies at 0.5 < z < 2.0, 
using the NMBS high-quality medium-band photometry 
in the COSMOS field. The optical and NIR medium 
bands are complemented by UV-to-IR broadband pho- 
tometry, leading to a total of 33 bands for each galaxy. 
We have identified analogous galaxies using 22 artificial 
rest-frame filters which are evenly spaced in log A be- 
tween 1200 and 50 000 A, resulting in 32 galaxy subsam- 
ples. Hence, our partitioning method is independent of 
stellar population models or other assumptions. For each 
subsample we constructed a composite SED by scaling 
and dc-rcdshifting the observed photometry. 

The composite SEDs are of spectroscopic quality 
and show spectral features, such as the (blended) 
Ha+[Nli]+[Sn] and H/3+[0iii] emission lines, the 
Balmer or 4000 A breaks, Mg II absorption at 2800 A, the 
continuum break at 2640 A, and the dust absorption fea- 
ture at 2175 A. Most of these features are not detected in 
the SEDs of individual galaxies. Thus, the increased S/N 
and sampling of the composite SEDs enable the detection 
and measurement of spectral features, which would nor- 
mally require spectroscopic data. Therefore these SEDs 
can be used to study the stellar populations and emission 
line characteristics of typical galaxies in a very detailed 
way, which is not possible when the photometry of in- 
dividual galaxies is considered separately. Additionally, 
the shape and lincwidth of the Ha emission line yield an 
indepedent measure of the photometric redshift errors. 
Consistent with comparison to spectroscopic redshifts, 
we find an accuracy of Az < 0.02 x (1 + z). 

In the present work, we measure Wna+ and D(4000) 
for all composite SEDs and compare the full SED shapes 
with S PS models . Mos t SE D types are equally well fit 
by the IMarastonl (|2005l) and iBruzual k Charlotl (|2003f ) 
SPS models. The exception, however, are po st-starburst 
galaxi es. A s has prev i ously been shown in iKriek et al.l 
<|2010T l , the IMarastonl (|2005| ) models cannot reproduce 
the full SED shape of post-starburst galaxies simulta- 
neously. They overpredict the NIR luminosity, imply- 
ing that these models give too much weight to TP-AGB 
stars. 

We find that Wn a +, £>(4000) and the best-fit SSFR 
are all strongly correlated, such that galaxies with shal- 
low breaks have high values for W / Ha+ and are best fit 
by stellar population models with high SSFRs. On the 
other hand, galaxies with strong 4000 A breaks have low 
values for Wka-i- an d are best fit by SPS models with 
low SSFRs. This is a remarkable result given that the 
relations are derived without spectroscopy, and solely 
based on medium-band photometry. Moreover, the Ha 
lines provide an independent measure of the stellar pop- 
ulations and confirm the results based on just the con- 
tinuum shape. Ha measurements at the targeted red- 
shift range are very challenging with spectroscopy, as 
the line is shifted to NIR wavelengths. At this mo- 
ment, spectroscopic Ha measurements beyond z = 0.5 



are o nly availably for small or biased galaxy samples 



(e.g.. iShaplev et al.l 12001; lErb et al.l l2006d: IKriek et al.l 
I2008bl: iForster Schreiber et al.l 120091) . and consquently 



our understanding of galaxies at this epoch is primarily 
based on photometric measurements, which use uncer- 
tain stellar population models to interpret the nature of 
the continuum emission. The composite SEDs - for the 
first time - open up the possibility to efficiently study 
Ha emission for large, magnitude-limited galaxy sam- 
ples, providing a crucial assessment of our photometric 
studies. 

We also study the rest-frame optical morphologies of 
a subsample of galaxies around z ~ 1 for each SED 
type. Interestingly, we find that with increasing D(4000), 
galaxies appear to change from irregular and clumpy, to 
fairly regular disks, to regular early-type systems at the 
highest D(4000). Strikingly, the most dusty galaxies are 
typically edge-on disks. 

The relation between Wu a + and D(4000) at 0.5 < z < 
2.0 is similar to the low-redshift relation, as determined 
from the SDSS galaxy sample. There may be a slight 
offset, such that at fixed value for D(4000), the Wh«+ of 
the 0.5 < z < 2.0 galaxies is slightly higher than for the 
low-redshift galaxies. We use simple models to interpret 
this difference, and show that the rate at which the SFR 
is declining can explain the difference in Wh q + at fixed 
Z?(4000); specifically, the SFRs in z ~ 1 galaxies may de- 
cline more gradually than at low redshift. The elevated 
1^0+ may also be explained by higher black hole accre- 
tion rates or higher metallicities at earlier times, or by 
relatively more obscured Hn regions at lower redshift. 

It is remarkable that W / h q + and D(4000) are so tightly 
correlated, given that WHa+ is mostly sensitive to cur- 
rent star formation and Z?(4000) is a measure of the older 
stellar population, and thus the past star formation. SPS 
model predictions show that SFHs that decline rapidly 
after their major burst - such as truncated SFHs or 
SFHs with short timescales - predict a steeper decline of 
W / Ha+ compared to J5(4000) than what is observed for 
the galaxies with intermediate to low values of Who+- 
The combination of D(4000) and W / Ha+ and the best-fit 
star formation timescales both indicate long star forma- 
tion timescales of ~ 1 Gyr (for a SFH parametrized by 
iexp(— t/r)) for older galaxy types. This suggests that 
the suppression of star formation at z < 2 is generally 
not an abrupt, but rather a gradual process. Our find- 
ing is consistent with the predictions by high-resolution 
smoothed particle hydrodynamics (SPH) simulations of 
massive ell iptical galaxies st arting from ACDM initial 
conditions (|Naab et al.ll2007t) . 

The SEDs that we presented here can be used for 
many additional purposes. For example, we can mea- 
sure the relative strength of emission lines, the strength 
of absorption features (e.g., Mgn at 2800 A), and con- 
strain the dust extinction curve including the 2175 A 
dust absorption feature. This paper briefly discussed 
the morphologies of the different SED types. A more 
quantitative analysis, with measurements of sizes and 
cllipticities will be presented in a future paper. For 
example, by comparing the sizes of galaxies in sev- 
eral evolutionary phases, we can assess theories pro- 
posed to_exrjkaiii_Uie_siz 

(e.g., Ivan Dokkum et al.l 120081 iNaab et al.1 [2007L [200l 
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iBezanson et"al|[200l iHopkins et al.l [20091 . [20Toh . Com- 
parison with other wavelength regimes also offers many 
interesting possibilities. For instance, X-ray stacks al- 
low the study of AGN demographics among the different 
types. 

Due to the low resolution of the NMBS, it is unfor- 
tunately not possible to deblend the emission lines Ha, 
[Nil], and [Sn] or H/3 and [Oni]. Thus, NIR spec- 
troscopy is needed to test our Ha measurements, to de- 
rive emission line ratios, and to study the contribution 
from AGNs to the emission line fluxes. Such measure- 
ments would also be very valuable to study the metallici- 
ties and Balmer decrements within and among the differ- 
ent samples. Altogether, our composite SEDs in combi- 
nation with follow-up spectroscopy and other wavelength 
data enable many different studies to better understand 
the origin and evolution of galaxies. 

We thank the members of the NMBS team for their 
help with the observations and construction of the cata- 
logs, Charlie Conroy, Ryan Foley, and Jeremiah Ostrikcr 
for useful discussions, and the COSMOS and AEGIS 
teams for the release of high-quality multi-wavelength 
data sets to the community. M. K. acknowledges sup- 
port of a Clay Fellowship administered by the Smithso- 
nian Astrophysical Observatory. 
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Figure Al. Composite SEDs of the remaining 17% of the parent sample, that were not included in the original 32 galaxy composite SEDs 
in Figure [4] The symbols and lines are the same as in Figure [4] The remaining galaxies are binned by increasing the value of b in Equation 
(1), until less than 20 galaxies are left. Most SED types look similar to ones in the initial samples, though with more scatter. However, 
there are several types which are different, such as SED 42, 44 and 48. Thus, the most un-obscured star-forming galaxies are likely missed 
in our primary sample. 



York, D. G. et al. 2000, AJ, 120, 1579 

APPENDIX 
APPENDIX A: COMPLETENESS 

The composite SED collection of 32 template constitutes 83% of the —4200 galaxies in our parent sample. Here, we 
assess the completeness and discuss which SED types we are possibly missing. For the 17% of the remaining galaxies 
we use the same technique to find analogous galaxies, but now taking b = 0.1 in Equation (1). We divide up the 
sample, such that each primary has at least 19 analog galaxies and construct 12 composite SEDs with this value of b. 
They are presented in Figure [XT] (33-44). Next we increase b to 0.15, 0.20, 0.25 and 0.30 until the rem ainin g sample 
is less than 20 galaxies. The six composite SEDs constructed with values of b > 0.1 are shown in Figure [All as well. 

Many SED types in Figure ETl look similar to one of the 32 galaxy types in the main sample. However, SEDs 42, 44 
and 48 are not represented in the 32 galaxy types. Thus, the most un-obscured star-forming galaxies are likely missed 
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Figure A2. Bottom: The value of the correlation quantity b (see Equation 1) between the primary galaxies of the different subsamplcs. 
Top: the minimum value of b for the primary galaxy of a certain SED types, when compared to all other primaries in the main sample 
(first 32 SEDs). 

in our primary sample. Their SSFRs are in the order of log SSFR = [-7,-8]. However, as their Balmer breaks are very 
shallow and as their Lyman breaks are not in the observed window for 0.5 < z < 2.0, their photometric redshifts have 
large uncertainties. Thus, the measured M / hq+ for these galaxies is highly uncertain. Galaxy types with high values 
for b give less insight into the remaining galaxy population, as they are likely composed of intrinsically different SED 
types. 

To do a more quantitative analysis, we relate all primary galaxies with the primaries of the main sample in Figure lA"2l 
In the bottom panel we indicate the values for b (see Equation 1) between the different SED types. In the top panel 
we show the minimum value of b for each SED type, when compared to all other primary galaxies of the main sample. 
This figure illustrates that our main sample is well sampled, with typical b values of ^0.05-0.06 between the closest 
types. SED t ypes 6 and 24 have slightly higher values with b ~ 0.08. The SED types in the b = 0.1 sample (SED type 
33-44, Figure |A"T|) relate to the main sample with values of b ~ 0.06. The exceptions are SED types 42 and 44. As 
mentioned above, these galaxies are indeed not represented in our main sample. The SED types with higher values 
of b are less correlated to the primaries of the main samples. For some types (e.g., 48) this means that their SED is 
very different from the main sample, while for other types this may reflect the fact that the large variation of galaxies 
within one type leads to unphysical composite SEDs. 

APPENDIX B: STACKING TEST 

In this appendix we assess whether our stacking technique indeed yields the average properties of the individual 
galaxies in the subsamples. We do so by applying our stacking technique to galaxy spectra in the SDSS. We start with 
10 000 galaxy spectra in the SDSS and select all galaxies at 0.095 < z < 1.05. For each galaxy we determine the fluxes 
in the rest- frame filters (shown in Figure [TJ), which overlap with the spectra. Next, we correlate all galaxies with each 
other using Equation 1. As the galaxies are matched using only three filters, we start with a smaller value for b of 0.01 
to find analogs. Similar as for the higher redshift sample, we increase the value for b, requiring that each SED type 
consists of a minimum of 20 galaxies. We increase b until nearly all galaxies are part of an SED bin. In total we have 
26 subsamples for which we construct the composite SEDs. 

We measure £>(4000) in the composite spectra and compare these to the mean of the individual measurements in 
Figure [Bl] The individual measurements are adopted from the MPA-JHU DR7 catalogs. The color of the datapoints 
reflect the value of b, and thus the scatter of the properties within the subsamplcs. The errorbars on the mean 
properties of the individual galaxies are derived using bootstrapping. We also calculate the errors on the stacked 
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Figure Bl. Comparison of the average values of individual galaxies with the value as measured from the stacked spectrum of the same 
individual galaxies, for both Whq+ (left) and D(4000) (middle). For this test we use ~1000 galaxies in SDSS at z ~ 0.1, and construct 
the stacked spectra in the same way as for the higher redshift sample. The right panel shows 1Vho+ vs. D(4000), with the individual 
measurements represented by the small gray dots and the stacked values represented by the solid circles. The sizes of the solid circles scale 
with the number of galaxies in the stack. The colors of the symbols indicate the value for b (see Equation 1) of the stacked spectrum. This 
figure illustrates that with our stacking method we recover the average properties of the individual galaxies. 




Figure CI. Left: Measurements for Z?(4000) derived from the best-fit Bruzual & Chariot (2003) models to the composite SED at the 
original resolution versus D(4000) measured after convolving the best-fit model with the composite filter curves. For each composite SED 
we derive a resolution correction factor based on the ratio of these two D(4000) measurements. The correction factor depends on the 
strength of the break and the composite filter curves. The dashed lines represent the correlations for two particular filter sets. Right: 
The effect of photometric redshift errors on the break measurement. We do not correct for this effect, as the exact redshift uncertainties 
for each SED type are unknown. Nonetheless, with this exercise we caution that the measurements for D(4000) for larger breaks may be 
underestimated. 

values using simulated spectra, which arc constructed using the noise spectra. 

There is good agreement between the break measurements for D(4000) for the stack and the individual systems, 
with a small systematic offset toward higher values for the stack. The different measurements for Whch-i- are also in 
very good agreement, except for at the lowest value of Wh q + of 2-3 A. This is reassuring, in particular given the fact 
that both measurements use different techniques to correct for the underlying Balmer absorption. The agreement does 
not worsen for larger values of b; even for composite spectra which are constructed from galaxies which exhibit a large 
intrinsic scatter in SEDs, the stacks still recover the average of the individual measurements. 

APPENDIX C: D(4000) CORRECTION 

The composite SEDs allow the direct measurement of the 4000 A break. In this paper we use the index D(4000) 
by iBruzuaJ ([1983) to quantify its strength. This index measures the difference in flux in small wavelength regions 
bracketing the break, as indicated by the orange shaded areas in the inset in Figure IC11 However, this measurement 
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will by affected by the low resolution of the composite SED. The inset in Figure IC II shows a synthetic SED in the 
region around the break. The red curve shows this same SED, but now convolved by the composite filter curves. This 
figure clearly illustrates that for this case the convolved SED will yield a slightly lower value for _D(4000) than the 
actual SED. 

We can correct for this effect as we know the composite filter curves for all SEDs. For each SED type we derive a 
correction factor using its best-fit stellar population model. First, we measure the break in the original best fit, and 
then we derive £>(4000) for the same model , bu t now convolved with the composite filter curves. We show both these 
measurements for each SED type in Figure [UTl We obtain the correction factor by dividing them by each other, and 
we assume a 5% error on this factor. 

The correction fac tor i s dependent on the actual value of £>(4000) and the sampling of the composite filter curves. 
For example, Figure ICll shows that for small breaks the correction factors are close to unity. This is not surprising, 
as for these galaxies D(4000) measures the slope of the continuum, and this is less affected by resolution. However, 
the larger the break, the larger the correction factor. The dependence on the filter curves is illustrated using the 
dashed colored lines. These lines indicate the correction for different values of Z?(4000) for a particular composite filter 
set. For SEDs with less composite filters the corrections are larger. Note that the break me asurement is not always 
consistent with that of the best-fit model, and thus the distribution of D(4000) in Figure ICll is slightly different from 
the measured distribution. 

The break measurements are also affected by photometric redshift uncertainties. In the right panel in Figure ICll 
we show the effects on the break measurements for different values of Az/(1 + z). The widths of the blended Ha 
emission lines (see Figure [7]) indicate a photometric redshift uncertainty of Az/(1 + z) < 0.02. Comparison with 
spectroscop ic redshifts shows th at the photometric redshifts in the NMBS have an uncer tainty of Az/(1 + z) = 
0.01 - 0.02 (jWhitaker et alJl2011[ ). However, the uncertainties will depend on SED type (e.g.. lKriek et al.ll2008al ). and 
as the exact scatter per SED type is unknown, we have decided not to correct the break measurements for photometric 
redshift uncertainties. This means that the values for D(4000), in particular for galaxies with strong breaks may be 
underestimated. 



